Bayesian Modeling of MPSS Data: Gene Expression Analysis of Bovine Salmonella Infection.
نویسندگان
چکیده
Massively Parallel Signature Sequencing (MPSS) is a high-throughput counting-based technology available for gene expression profiling. It produces output that is similar to Serial Analysis of Gene Expression (SAGE) and is ideal for building complex relational databases for gene expression. Our goal is to compare the in vivo global gene expression profiles of tissues infected with different strains of Salmonella obtained using the MPSS technology. In this article, we develop an exact ANOVA type model for this count data using a zero-inflated Poisson (ZIP) distribution, different from existing methods that assume continuous densities. We adopt two Bayesian hierarchical models-one parametric and the other semiparametric with a Dirichlet process prior that has the ability to "borrow strength" across related signatures, where a signature is a specific arrangement of the nucleotides, usually 16-21 base-pairs long. We utilize the discreteness of Dirichlet process prior to cluster signatures that exhibit similar differential expression profiles. Tests for differential expression are carried out using non-parametric approaches, while controlling the false discovery rate. We identify several differentially expressed genes that have important biological significance and conclude with a summary of the biological discoveries.
منابع مشابه
Role of SPI-1 Secreted Effectors in Acute Bovine Response to Salmonella enterica Serovar Typhimurium: A Systems Biology Analysis Approach
Salmonella enterica Serovar Typhimurium (S. Typhimurium) causes enterocolitis with diarrhea and polymorphonuclear cell (PMN) influx into the intestinal mucosa in humans and calves. The Salmonella Type III Secretion System (T3SS) encoded at Pathogenicity Island I translocates Salmonella effector proteins SipA, SopA, SopB, SopD, and SopE2 into epithelial cells and is required for induction of dia...
متن کاملRNA-Seq Bayesian Network Exploration of Immune System in Bovine
Background: The stress is one of main factors effects on production system. Several factors (both genetic and environmental elements) regulate immune response to stress. Objectives: In order to determine the major immune system regulatory genes underlying stress responses, a learning Bayesian network approach for those regulatory genes was applied to RNA-...
متن کاملThe modeling of body's immune system using Bayesian Networks
In this paper, the urinary infection, that is a common symptom of the decline of the immune system, is discussed based on the well-known algorithms in machine learning, such as Bayesian networks in both Markov and tree structures. A large scale sampling has been executed to evaluate the performance of Bayesian network algorithm. A number of 4052 samples wereobtained from the database of the Tak...
متن کاملP-127: Characterization of Filia, A Maternal Effect Gene, in Bovine Oocytes and Embryos
Background: Genetic analysis in mice has lead to find about maternal effect genes such as Filia. Filia knock out mice have a 50% decrease in fertility. Filia dysfunction causes disorders in pre-implantation development. Mutations in human Filia gene, cause FBHM (Familial Biparental Hydatidiform Mole) in women. Filia protein in mice is homologous to that of rat and human, so this idea has emerge...
متن کاملModification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of the American Statistical Association
دوره 105 491 شماره
صفحات -
تاریخ انتشار 2010